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Method of Content Identification, Devjcs^ndJSoftwarp 



The invention relates to a method of content identification, comprising the step 
of creating a first signature for a first content item comprising a first sequence of frames. 

The invention further relates to an electronic device comprising an interface 
for interfacing with a storage means storing a first signature of a first content item, the first 
content item comprising a first sequence of frames; a receiver able to receive a signal 
comprising a second content item, the second content item comprising a second sequence of 
frames; and a control unit able to use the interface to retrieve the first signature from the 
storage means, able to create a second signature for the second content item, and able to 
determine similarity between the first signature and the second signature. 

The invention further relates to software enabling upon its execution a 
programmable device to function as an electronic device. 



An embodiment of the method is known from EP 0 248 533. The known 
method performs real-time continuous pattern recognition of broadcast segments by 
constructing a digital signature from a known specimen of a segment, which is to be 
recognized. The signature is constructed by digitally parameterizing the segment, selecting 
portions among random frame locations throughout the segment in accordance with a set of 
predefined rules to form the signature, and associating with the signature the frame locations 
of the portions. The known method is claimed to be able to identify large numbers of 
commercials in an efficient and economic manner in real time, without resorting to expensive 
parallel processing or to the most powerful computers. 

As a drawback of the known method, it can only be executed in real time in an 
economic manner if the number of random frame locations is limited. Unfortunately, limiting 
the number of frame locations also limits the reliability of the pattern recognition. 
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It is a first object of the invention to provide a method of the kind described in 
the opening paragraph, which can be executed in real time in an economic manner while 
achieving a relatively high reliability of pattern recognition. 

It is a second object of the invention to provide an electronic device of the 
5 kind described in the opening paragraph, which is able to perform real-time pattern 
recognition with a relatively high reliability. 

It is a third object of the invention to provide software of the kind described in 
the opening paragraph, which can be executed in real time in an economic manner while 
achieving a relatively high reliability of pattern recognition. 

10 



According to the invention the first object is realized in that the step of 
creating the first signature comprises creating a first sub-signature to comprise a first 
sequence of first averages, a first average being stricken of values of a feature in multiple 

1 5 frames in the first sequence of frames. A feature may be, for example, frame luminance, 

frame complexity, Mean Absolute Difference (MAD) error as used by MPEG2 encoders, or 
scale factor as used by MPEG audio encoders. A frame may be an audio frame, a video 
frame, or a synchronized audio and video frame. 

An embodiment of the method of the invention further comprises the step of 

20 creating a second signature for a second content item comprising a second sequence of 

frames; in which the step of creating the second signature comprises creating a second sub- 
signature to comprise a second sequence of second averages, a second average being stricken 
of values of the feature in multiple frames in the second sequence of frames. The 
embodiment further comprises the step of determining similarity between the first and the 

25 second signature; and said step of determining similarity between the first and the second 
signature comprises determining similarity between the first and the second sub-signature. 

Similarity between the first and the second signature may be used to identify a 
short audio/video sequence in other streams. For real-time comparison of tens or even 
hundreds of signatures, computational efforts must be low. A signature of new content may 

30 be generated and compared to a database of signatures every N frames. Comparing signatures 
every frame will be computationally too intensive and even unnecessarily accurate in time. 
The signatures must be robust to noise and other distortions because a Personal Video 
Recorder-like device could have many different input sources ranging from high quality 
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digital video data to low quality analogue cable or VHS signals. By averaging over multiple 
frames, the effects of noise and other distortions are reduced. 

In an embodiment of the method of the invention, the step of determining 
similarity between the first and the second signature comprises calculating a coefficient of 
correlation between the first and the second signature and comparing the coefficient with a 
threshold. By averaging over multiple frames, a data set with a more or less normal 
distribution is obtained. The degree of normality of the distribution depends on the amount of 
frames being averaged. A good measure of similarity can be obtained by correlating two data 
sets with a normal distribution, e.g. using Pearson's correlation. Alternatively, a first average 
of a sequence of feature values could be subtracted from a second average of a sequence of 
feature values to obtain a different similarity measure. By comparing a similarity measure 
with a threshold, a positive or negative identification can be obtained, which can be the basis 
for further steps. 

The step of determining similarity between the first and the second signature 
may comprise calculating a coefficient of correlation between a first sub-sequence at a 
position in the first sequence of averages and multiple second sub-sequences in the 
neighborhood of a corresponding position in the second sequence of averages. This reduces 
the time-shifting problem, where, for instance, a missing frame in a content item might lead 
to a negative identification. Frames may be lost when displaying older VHS source material. 
Sometimes, the vertical synchronization is missed, resulting in lost frames. The time-shifting 
problem may also occur when a signature is not created every frame, but every plurality of 
frames. 

The coefficient of correlation between the first sub-sequence and the multiple 
second sub-sequences may be calculated by using weights, a weight being larger if a second 
sub-sequence is near the corresponding position and smaller if a second sub-sequence is 
remote from the corresponding position. Since time shifts between similar content items will 
more likely be minor than major, correlation is more likely to be accidental if the second 
element is remote from the corresponding position. Better identification can be achieved by 
using weights. 

The step of creating a signature may comprise creating multiple sub- 
signatures, and similarity between the first and the second signature is determined by using 
the multiple sub-signatures. Although one sub-signature per signature may be sufficient in 
some instances, the combinatorial behavior of low-level AV features of a short video 
sequence is more likely to be unique to this sequence. The uniqueness of a signature 
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comprising multiple sub-signatures depends on the amount of information it represents. The 
longer the feature sequences, the more unique the signature can be. Also, the more different 
types of features are used simultaneously, and thus the more sub-signatures, the more unique 
the signature can be. Due to the uniqueness of a signature, a large number of signatures can 
be uniquely identified under a variety of conditions using a single, pre-defined, identification 
criterion. In case a service provider provides the signatures, the identification criterion could 
in principle be designed per signature. This is because the service provider is able to test 
identification criteria for a signature on a large amount of content beforehand. However, in 
case of signatures defined by a user, a single, pre-defined, identification criterion should 
suffice for all signatures. 

Creating a sub-signature may comprise reducing the number of averages. This 
reduces the required amount of processing. Since feature values are averaged, sub-signatures 
can be sub-sampled without losing significant information. Large differences between values 
are more significant than small differences. Since differences between average feature values 
will be smaller than differences between feature values, the amount of average feature values 
can be smaller than the amount of feature values. 

If the second content item is comprised in a third content item and the first and 
the second signature are similar, a further step may comprise skipping the second content 
item in the third content item. For instance, a signature could be made for an intro of a 
commercial block. Whenever the intro is identified, 3 minutes could be skipped. 
Alternatively, a signature could be made for a black or blue screen that is shown when no 
signal is present. The skipping could be done automatically or the user could press a button to 
skip a given amount of content. 

A further step may comprise identifying boundaries between a first segment 
and a second segment of a third content item, and another step may comprise skipping the 
first segment in the third content item if the second content item comprises the first segment 
and the first and the second signature are similar. The first segment may be, for instance, a 
commercial. The second segment may be, for instance, another commercial or a part of a 
movie. The segments of commercial blocks can be identified by using more general 
discriminators and separators in the A/V domain. Segments that are inside a commercial 
block can be detected reliably and even the boundaries between segments can be identified. 
The signatures of detected segments can be stored in a database. New incoming content can 
be correlated in real-time with the existing signatures of segments in the database and if the 
correlation is high enough, the content will be tagged as commercial segment. Due to the fact 
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that segments of commercial blocks are of a repetitive nature and vary in their position inside 
a commercial block, there is a good chance to learn reliable signatures of unknown 
commercials. With this method, the precision of a commercial block detector can be 
increased significantly. 

A further step may comprise recording the second content item if the first and 
the second signature are similar. If the first signature was made for an intro of a comedy 
series, a Personal Video Recorder (PVR) using the method of the invention may start 
recording as soon as the first and the second signature are found to be similar. Recording may 
also be started in retroaction, using a time-shift mechanism. This is useful when the generic 
intro of a series is not at the beginning of the program. The first signature, a recording start- 
time and end-time relative to the position of the first sequence of frames in the first content 
item, and a set of channels to scan for the second signature could be given by the user or 
downloaded from a service provider. The method of the invention may also be used to search 
for a second signature in a database, retrieve the accompanying second content item from the 
database, and store the second content item. 

A further step may comprise generating an alert if the first and the second 
signature are similar. A PVR using the method of the invention may alert a user by showing 
the content of interest in a Picture In Picture (PIP) window, with an icon and/or sound. The 
user could then decide to switch to the identified content by pressing a button on the remote 
control or to remove the alert. When the user switches to the identified content, he or she 
could start watching the identified content live or play, in retroaction, from the beginning of 
the content, using a time-shift mechanism. 

According to the invention the second object is realized in that the control unit 
is able to create a first sub-signature from the first signature, the first sub-signature 
comprising a first sequence of averages of values of a feature in multiple frames in the first 
sequence of frames; to create a second sub-signature for the second signature by averaging 
values of the feature in multiple frames in the second sequence of frames; to determine 
similarity between the first and the second sub-signature; and to determine similarity between 
the first and the second signature in dependence upon the similarity between the first and the 
second sub-signature. The device of the invention may be a Personal Video Recorder (PVR), 
a digital TV, or a satellite receiver. The control unit may be a microprocessor. The interface 
may be a memory bus, an IDE interface, or an IEEE 1394 interface. The interface may have 
an internal or an external connector. The storage means may be an internal hard disk or an 
external device. The external device may be located at the site of a service provider. 
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In an embodiment of the device of the invention, the control unit is able to 
determine similarity between the first and the second signature by calculating a coefficient of 
correlation between the first and the second signature and comparing the coefficient with a 
threshold. 

If the second content item is comprised in a third content item and the first and 
the second signature are similar, the control unit may be able to urge a further storage means 
to store the third content item without the second content item. 

The control unit may be able to urge a further storage means to store the 
second content item if the first and the second signature are similar. 

The control unit may be able to generate an alert if the first and the second 
signature are similar. 

According to the invention the third object is realized in that the software 
comprises a function for creating a signature for a content item comprising a sequence of 
frames, the function comprising creating a sub-signature to comprise a sequence of averages, 
an average being stricken of values of a feature in multiple frames in the sequence of frames. 

An embodiment of the software of the invention further comprises a function 
for determining similarity between two signatures by calculating a coefficient of correlation 
between the two signatures and comparing the coefficient with a threshold. 

The software may be stored on an record carrier, such as a magnetic info- 
carrier, e.g. a floppy disk, or an optical info-carrier, e.g. a CD. 



These and other aspects of the method and device of the invention will be 
further elucidated and described with reference to the drawings, in which: 

Fig.l is a flow chart of a favorable embodiment of the method; 

Fig.2 is a flow chart detailing a first and a second step of Fig. 1; 

Fig. 3 is a flow chart detailing a third step of Fig.l; 

Fig.4 is a block diagram of an embodiment of the electronic device; 

Fig.5 is a schematic representation of two steps of Fig.2; 

Fig.6 is a schematic representation of a variation of the two steps of Fig.5; 

Corresponding elements within the drawings are denoted by the same 
reference numerals. 
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The method of Fig. 1 comprises a step 2 of creating a first signature for a first 
content item comprising a first sequence of frames. Step 2 comprises creating a first sub- 
signature to comprise a first sequence of first averages, a first average being stricken of 
values of a feature in multiple frames in the first sequence of frames. 

5 The method of Fig. 1 may further comprise a step 4 of creating a second 

signature for a second content item comprising a second sequence of frames and a step 6 of 
determining similarity between the first and the second signature. Step 4 comprises creating a 
second sub-signature to comprise a second sequence of second averages, a second average 
being stricken of values of the feature in multiple frames in the second sequence of frames. 

10 Step 6 comprises determining similarity between the first and the second sub-signature. 

Steps 2 and 4 may comprise creating multiple sub-signatures, and similarity 
between the first and the second signature may be determined by using the multiple sub- 
signatures. 

If the second content item is comprised in a third content item and the first and 
15 the second signature are similar, an optional step 8 allows skipping the second content item in 
the third content item. A further step may comprise identifying boundaries between a first 
segment and a second segment of a third content item. Optional step 10 allows skipping the 
first segment in the third content item if the second content item comprises the first segment 
and the first and the second signature are similar. Optional step 12 allows recording the 
20 second content item if the first and the second signature are similar. Optional step 14 allows 
generating an alert if the first and the second signature are similar. 

Steps 2 and 4 shown in Fig.l may both be subdivided into three steps, see 
Fig.2. Step 22, see also Fig.5, creates a sequence featureSeq(j,k) of feature values from a 
feature Ij in multiple frames of a sequence of frames, k is a unique identifier for the sequence 
25 of frames. Content(k) is the content item comprising the sequence of frames. Time(k) is the 
time instance of the last frame of the sequence of frames expressed as a frame number in 
content(k). Feature (C, p, j) is the value of feature Ij at time instance p in content item C. The 
sequence of feature values will have length L. 



30 



featureSeq(y,/r) 

= [feature(content(£), time(fc) -L + 1J) ... feature(content(fc), time(fc) , j)] 



WO 2004/019527 



PCT/IB2003/003289 



8 



Step 24, see also Fig.5, creates a first sub-signature using the sequence of feature values. The 
sequence of feature values is window-mean filtered with a filter window length of F frames 
using the following function: 



By using the filter function, the problem of noise and distortions is reduced. Due to varying 
signal conditions or encoding conditions, the feature sequences can be distorted in multiple 
ways. Distortions could lead to a missed or a false identification of a video sequence. 



sequence of feature values is window-mean filtered, it could be sub-sampled without losing 
significant information. Sub-sampling every F/2 period has the advantage that the total 
number of data points in the signature decreases by a factor F/2 and thus makes it possible to 
compare more signatures simultaneously, r is the sub-sampling rate, the default value is F/2 
assuming even F. K is the number of samples in the sub-sampled filtered sequence. K is a 
natural number that is rounded down if L-F+l is not an integral multiple of r. 



Sub-signature (j, k) is the sub-sampled and filtered sequence of feature values in content(k) in 
the filter window at time(k) for feature Ij: 

sub - signatu re(y , k) = [filter(y, k, r) filter(y, k y 2r) • • ■ filter(y, k 9 Kr)] 

Steps 22 and 24 may be repeated several times to create multiple sub-signatures for multiple 
features. Step 26 creates the first signature using the sub-signatures created in step 24. A 
signature consists of M sub-signatures: 



Under general conditions, the proposed signature can be generated very efficiently during 
online operations. Every Nth frame, a new signature(knew) of received or stored content can 
be made. The first time, a complete signature(koid) must be made. However, after that, a new 
signature(k nC w) can easily be created by using the N new frames. Sub-signature 0> k new,k o id) 



1 F 

filter(y,*,/>) = — XfeatureSeq(y,*) 



Step 24 reduces the number of averages by using sub-sampling. Because a 




signature(k) = sub - signature T (l,k) • • • sub - signature' (M, k) 
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equals sub-signature (j,kn ew ) if N is a multiple of the sub-sampling rate r. Content (k new ) 
comprises content (kow) and time(k new ) == time(koid) + N. 



In step 82 shown in Fig. 6, FeatureSeq (j, k new , koid) creates an updated 
5 sequence of feature values from a feature Ij in multiple frames in an updated sequence of 
frames: 



newFeatureSeq(y, k) 

= [feature(content(fc), time(fc) -N+l,j) ... feature(content(fc), time(/r) , j)] 
featureSeq(y,* new ,A: oW ) = 

= [featureSeqO,A: <>w ) Af+l ••■ featureSeq(y,fc oM ) i newFeatureSeq(y,* Bew )] 

10 Filter (j, k new , ko W ,p) is the updated filter function for a feature Ij in multiple frames in the 
updated sequence of frames: 



filterOA w » = 



filterO-,* oW ,/>), pZL-F-N + l 

i£featureSeq(M ne H.>* oW ) p+m . 1 , otherwise 



m=l 



15 Filter QX>\d,p) is pre-calculated. If N is an exact multiple of the sub-sampling rate r, then 
Z=N/r and sub-signature (j»k ne w,koid)» see step 84, is the updated sub-sampled filtered 
sequence. Sub-signature (j, kow) is pre-calculated. 



sub - signature (/, k„ ew , k oUt ) = 

[sub -signature {j>k 0 ld)z+\ "' sub - signature [j 9 k Q \£ ) x. «»0\*» >*otf ~ Z + - filtcr O*> , k M , Kr)\ 

20 

Step 6 shown in Fig.l determining similarity between the first and the second signature may 
be subdivided into six steps in a favorable embodiment, see Fig.3. In the favorable 
embodiment, sub-signatures are not compared as a whole but small sliding window 
sequences, called context windows, are compared instead. Using context windows solves the 
25 problem of shifts in timing between two similar or even equal sub-signatures. These shifts 
can occur because a signature is compared only every N frame. Using context windows also 
solves the problem of local shifts in the sequence due to missing or inserted frames. Although 
comparing the Fourier-power spectra of the sub-signatures may also solve this problem, 
because the power spectrum is invariant to shifts, differences at the borders of the sub- 
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signatures could result in differences in the power spectra. Furthermore, computational 
efforts of this solution might be much higher. 

Step 42 creates context windows for the first and the second signatures created 
in steps 4 and 6 shown in Fig.l . Context windows are created for each value in each sub- 
signature in both signatures and comprise multiple values from a sub-signature around a 
position in the sub-signature. The Matrix of context windows for a sub-signature(j,ki): 
sub - signatureO , k x ) x — sub - signature^, k x ) w 



CW(j,k x ) = 



sub - signatureO', k x ) K -w+\ * • • sub - signatureO*,*, ) K 



«w r (M)i 



Step 44 calculates the correlation between each context window in a first sub-signature and 
10 each context window in a second sub-signature. The calculation comprises creating 
normalized context windows and calculating contextCorr(j,ki,k 2 ,pi,p2): 



ncw r {J 9 k x ,/>) = < 



cw r (/. fr),)- mean(cw r (J 9 k x ) p ) 



[Not A Number (NaN)] 
ncw T (j\k }9 p) 



std(cw T U,k t ) p )*0 
std(cw r O\*.)|.)=0 



ncw T U>k l9 K-W + l) 

new T (J. k x , p x )new(j\ k 2> p 2 ) std(ncw r (j\ k x ,p x j)*0* 



contextCorr(y , k Xi k 29 p X9 p 2 ) = 



W-\ 
NaN, 



std(ncw r O,*2>P2))^° 
otherwise 



15 The proposed similarity measure is based on correlation. Correlation can always be 

consistently scaled between -1 and 1, independent of the mean and variance of the signatures. 
Consequently, correlation is also more robust to distortions than, for instance, the Mean 
Square Error. Context correlation is undefined if one of the window sequences is constant. 
Although another measure could be defined if one of the context window standard deviations 

20 is zero, this will make the overall signature similarity measure inconsistent. Thus, effectively 
only the non-constant parts are compared, which has the disadvantage that the comparison is 
less strict. Increasing the context window width can increase the number of non-constant 
parts; this, however, increases the computational load. Step 44 is repeated for each first sub- 
signature and each second sub-signature created for the same feature. 
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Step 46 calculates a coefficient of correlation contextSim(j>ki,k 2 ,p) between a 
context window at position p in the first sub-signature and multiple context windows in the 
second sub-signature. The final context window similarity at position p in sub-signature(j,ki) 
with the context window at a corresponding position p in sub-signature(j,k 2 ) is defined as the 
5 best context correlation with the context window at neighborhood positions p-Ln to p+ U of 
sub-signature (j>k 2 ). U is the neighborhood radius. Q(j,ki,k 2 ,p) is a set of positions from sub- 
signature (j,k 2 ), the positions being in the neighborhood of position p from sub-signature (j, 
ki): 

1 0 QU, k l9 k 2 ,p) = {q: {max{/> - L n ,l}, • ♦ • , min{/> + L n , K - W + l}}|contextCorr(y , k l9 k 2 , p, q) * NaN} 



20 



contextSim(y , k , * ,p) ■ < 



max (contextCorr(y , ^ , k 2 , p, q)\ Q(j y k^k^p)*® 

qeQU>k v k 2> p) 

NaN, QU,k v k r p) = 0 



1 5 Step 46 is repeated for each first sub-signature and each second sub-signature created for the 
same feature. 

Step 48 calculates a coefficient of correlation subSigSim(j,ki,k 2 ) between a 
first sub-signature (j, ki) and a second sub-signature (j, k 2 ) 



P(j>K> k i) = {p : { 1 >-> K ~ w +lJccmtexlSimC/ i * |t t 2t p) * NaN} 

1 



subSigSim(y, k x , k 2 ) = « 



contextSim(7 , k }y k 2 , />), P(j, k t ,k 2 )*0 



\PUA> k l)\ peP(j^k 2 ) 

NaN, P{j,k x ,k 2 ) = 0 



As shown above, the complete sub-signature similarity is defined by the average context 
similarities that are defined. If all context windows are constant, the sub-signature similarity 
is not defined. Finally, the complete signature similarity is defined as the average of defined 
25 sub-signature similarities. Step 48 is repeated for each first sub-signature and each second 
sub-signature created for the same feature. 

Step 50 calculates a coefficient of correlation signatureSim(ki,k 2 ) between the 
first and the second signature. 



WO 2004/019527 PCT/IB2003/003289 

12 



J(JA» k i) = {f : {l,»,Af]|subSigSiin(y > * 1 ,A 2 ) * NaN} 



signatures im^, ,k 2 ) = 



. f — J — r-77 £subsi g sim(y,* p * 2 ) I JUAM*® 

2 { \Aj 9 k l9 k 2 )\ Je j Utklt k 2) J 

NaN, J(j>KM = ® 



The signature similarity is scaled such that its range is from zero to one, although this is not 
necessary. Note that, in extreme situations, the signature similarity can be undefined if one or 

5 both of the signatures are completely constant. 

Step 52 compares the coefficient with a threshold. When the coefficient is 
higher than the threshold, the first and the second signature and hence the first and second 
content item, e.g. audio/video sequences, can be identified as being equal. When the 
signatures are too simple, i.e. not specific enough, a good threshold will not exist. There are 

10 multiple signature generation parameters that can be varied to increase the specificity of the 
signatures. Identification quality could be further improved by generating multiple signatures 
for an audio/video sequence at multiple time instances, for instance, at time(k), time(k)+G, 
time(k)+2G, etc. In order to identify the sequence, a large percentage of the generated 
signatures should be positively identified. This improves the robustness and quality of the 

15 identification mechanism. 

Weights may be used in step 46 to calculate the coefficient of correlation 
contextSim(j,ki,k 2 ,p) at position p in the first sub-signature and multiple context windows in 
the second sub-signature of the second signature, a weight being larger if a context window 
in the second sub-signature is near the corresponding position p and smaller if the second 

20 element is remote from the corresponding position p. ContextSim(j,ki,k 2 ,p) is redefined to 
incorporate a weight w(p,q): 



QU\ k i > k 2 > P) = fe : K " w + l)contextConO\ k x , k 2 , p 9 q) * NaN} 
contextSimO* , k x , k 2 , p) = < 



max (h<P, q) contextCorr(y , *, , k 2 , p, q)\ Q(j, ki^p)*® 

NaN, e(Mi,*2,P) = 0 



25 



The weight function w(p,q) is a block function if all context windows in the second sub- 
signature that are in the neighborhood of the corresponding position p have equal weight. 
With this weight function, the original formulation as previously defined is preserved: 
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max{p - L n ,l} < q <> mm{p + L n , K - W + 1} 
otherwise 



The weight function w(p,q) is a triangular function if a weight is used in such a way that 
context windows further from corresponding position p are less important: 



w 



— + max{p - L w ,l} < q < min{/> + L w ,K-W + \} 



0, 



otherwise 



2Lw is the triangle base length. 



Similarity can be evaluated efficiently during online operations. Every N 
frame, a new signature of received or stored content is made and compared with multiple 
reference signatures. For each reference sub-signature(j,ki), a context correlation matrix 
CCG,ki,k 2 ) is maintained, containing the context correlation of each context window of sub- 
signature(j,kl) with all context windows in sub-signature^^). 

CCO', k x ,k 2 ) = [ccC/\ *i , * 2 )i • - CC U> k \ M ) K-W+\ ] = 

contextCorrO*,^i.^2»W) contextCorrO", *, ,k 2 ,1, K - W + 1) 

contexteorrO-,* 1> * 2 ,/e-0' + 1,1) - amte*Con(y, 1^ + 1,*- IF + 1). 



A context similarity matrix is calculated by using neighborhood-weighting matrix W: 



W = 



w 



(1,1) 



w{l,K-W + l) - v/{K-W + l,K-W + l) 



The context similarity matrix: 



CS(y, k x ,k 2 ) = [contextSim(y , £, , k 2 ,1) • • • contextSimO' , k x , k 2 , K - W + 1)] 
= max(W.*CC(./,* I ,* 2 )) 
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The matrix max(A) operation finds the maximum per column of A . All NaN elements of A 
are discarded from the maximum operation. If all elements of a column are NaN, the 
maximum value for that column is NaN. The * . * * operator is the element-wise matrix 
multiplication operator. SubSigSim(j,ki,k 2 ) and signatureSim(ki Jc 2 ) can be calculated by 

5 using the context similarity matrix. 

Because an updated signature(k 2ne w) where time(k 2n ew) minus time(k 2oW ) 
equals N only contains Z (=N/r) new values at the end of the sub-signatures, only Z new 
normalized context windows are calculated. For the Z new context windows in sub- 
signature(j»k 2ne w), the context correlation with the (K-W+l) context windows of sub- 

10 signature(j,ki) is calculated. These correlation values are used to update the context 

correlation matrix CC(j,ki,k 2 ):= CC(j, k u k 2ne w). The Z new normalized context windows in 
sub-signature (j,ki): 

"new 7 " 0\*2 > K ~W + \-(Z- 1))" 
newNCW(y,fc 2 )= : 

The new context correlation matrix: 

15 CCO',*,,*^,*^) 

= [cc(y,*,,* 2oW )z + i - cc (J> k i> k 2 M )at-^i-z InewCCO,*,,^)] 



It is assumed that any linear operation with a NaN results in a NaN. Thus, if 
one or both of the normalized context windows is constant, the resulting context correlation 
is NaN. By using the updated context correlation matrices, all the new similarities can be 
20 calculated. 

The electronic device 62 of Fig. 4 comprises an interface 64 for interfacing 
with a storage means 66 storing a first signature of a first content item, the first content item 
comprising a first sequence of frames. The device 62 further comprises a receiver 68 able to 
receive a signal comprising a second content item, the second content item comprising a 
25 second sequence of frames. The device 62 also comprises a control unit 70 able to use the 
interface 64 to retrieve the first signature from the storage means 66, able to create a second 
signature for the second content item, and able to determine similarity between the first 
signature and the second signature. The control unit 70 is able to create a first sub-signature 
from the first signature, the first sub-signature comprising a first sequence of averages of 
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values of a feature in multiple frames in the first sequence of frames. The first sub-signature 
may be extracted from the first signature or, if the first signature comprises raw data, e.g. a 
sequence of feature values, the first sub-signature may be calculated in the same way as the 
second sub-signature. The first signature may also need to be processed in other ways to 
5 create the first sub-signature. The control unit 70 is able to create a second sub-signature for 
the second signature by averaging values of the feature in multiple frames in the second 
sequence of frames. The control unit 70 is able to determine similarity between the first and 
the second sub-signature. The control unit 70 is able to determine similarity between the first 
and the second signature in dependence upon the similarity between the first and the second 

10 sub-signature. The storage means 66 may be comprised in the device 62 or may be an 

external device. The storage means 66 may comprise, for example, a hard disk or an optical 
storage medium. The receiver 68 may receive a signal using cable 76. The receiver 68 may 
receive, for example, signals from a cable operator or from a satellite dish. 

The control unit 70 may be able to determine similarity between the first and 

1 5 the second signature by calculating a coefficient of correlation between the first and the 

second signature and comparing the coefficient with a threshold. If the second content item is 
comprised in a third content item and the first and the second signature are similar, the 
control unit 70 may be able to urge a further storage means 72 to store the third content item 
without the second content item. The control unit 70 may be able to urge a further storage 

20 means 72 to store the second content item if the first and the second signature are similar. 
The further storage means 72 may be comprised in the device 62 or may be an external 
device. The further storage means 72 may comprise, for example, a hard disk or an optical 
storage medium. The further storage means 72 and the storage means 66 may be physically 
or logically different parts of the same hardware. The control unit 70 may be able to use a 

25 further interface 78 to retrieve data from the further storage means 72. The interface 64 and 
the further interface 78 may be physically or logically different parts of the same hardware. 

The control unit 70 may be able to generate an alert if the first and the second 
signature are similar. The alert may be displayed by using a display 74. The alert may also be 
audible. If the device 62 is a Digital TV, the display 74 may be comprised in the device 62. If 

30 the device 62 is a Personal Video Recorder, the display 74 may be an external device. The 
display 74 may be, for example, a CRT, a LCD, or a Plasma display. The user may be 
responsible for initiating the creation of the first signature. He or she could press a generate 
signature' button on a remote control of a PVR at the moment when a generic intro of a 
program is shown. After the button is pressed, the PVR could ask the user what to do when 
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the first signature and the second signature are similar. If the user wants the program to be 
recorded, he or she may be able to specify the relative recording start time and end time but 
also a set of channels to scan. For instance, -3 min. 00 sec to +30 min 00 sec on ABC, CBS, 
and NBC. If a user wants to be alerted, he or she may be able to specify a set of channels to 
scan. The user may also be able to indicate that an occurrence of a similar signature is to be 
stored in a database enabling a user to jump to content or to skip content during playback. 

The PVR may also be able to search for a second signature similar to the first 
signature in a collection of stored content and play back the second content item if the second 
signature is found. In this way, a user could jump from the start of one stored episode to the 
start of another stored episode of the same series. Another way to jump is to have predefined 
signatures. A user may be able to select a specific first signature from a list of signatures. 
With a button-press, the user can jump to the next instance of an intro. Instead of using a list, 
a small set of signatures could be programmed by the user on the remote control. If a user 
always likes to watch a specific news show or a specific TV comedy, he or she could 
program generic buttons on the remote control to link to these programs using the predefined 
signatures. If a user is playing back stored content and presses the generic button that links to 
the specific news show, the PVR will jump to a next identified intro of the specific news 
show. If the button is pressed again, the PVR will jump again to a next identified intro. The 
first and the second signature may be compared while the second content item is being stored 
in the collection of stored content. 

While the invention has been described in connection with preferred 
embodiments, it will be understood that modifications thereof within the principles outlined 
above will be evident to those skilled in the art, and thus the invention is not limited to the 
preferred embodiments but is intended to encompass such modifications. The invention 
resides in each and every novel characteristic feature and each and every combination of 
characteristic features. Reference numerals in the claims do not limit their protective scope. 
Use of the verb "to comprise" and its conjugations does not exclude the presence of elements 
other than those stated in the claims. Use of the article "a" or "an" preceding an element does 
not exclude the presence of a plurality of such elements. 

'Means', as will be apparent to a person skilled in the art, are meant to include 
any hardware (such as separate or integrated circuits or electronic elements) or software 
(such as programs or parts of programs) which perform in operation or are designed to 
perform a specified function, be it solely or in conjunction with other functions, be it in 
isolation or in co-operation with other elements. The invention can be implemented by means 
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of hardware comprising several distinct elements, and by means of a suitably programmed 
computer. In the device claim enumerating several means, several of these means can be 
embodied by one and the same item of hardware. 'Software' is to be understood to mean any 
software product stored on a computer-readable medium, such as a floppy disk, 
5 downloadable via a network, such as the Internet, or marketable in any other manner. 



